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Abstract 

The arginine binding protein from Thermatoga maritima (TmArgBP), a substrate binding protein (SBP) involved in tlie ABC 
system of solute transport, presents a number of remarl<able properties. Tliese include an extraordinary stability to 
temperature and chemical denaturants and the tendency to form multimeric structures, an uncommon feature among SBPs 
involved in solute transport. Here we report a biophysical and structural characterization of the TmArgBP dimer. Our data 
indicate that the dimer of the protein is endowed with a remarkable stability since its full dissociation requires high 
temperature as well as SDS and urea at high concentrations. In order to elucidate the atomic level structural properties of 
this intriguing protein, we determined the crystallographic structures of the apo and the arginine-bound forms of TmArgBP 
using MAD and SAD methods, respectively. The comparison of the liganded and unliganded models demonstrates that 
TmArgBP tertiary structure undergoes a very large structural re-organization upon arginine binding. This transition follows 
the Venus Fly-trap mechanism, although the entity of the re-organization observed in TmArgBP is larger than that observed 
in homologous proteins. Intriguingly, TmArgBP dimerizes through the swapping of the C-terminal helix. This dimer is 
stabilized exclusively by the interactions established by the swapping helix. Therefore, the TmArgBP dimer combines a high 
level of stability and conformational freedom. The structure of the TmArgBP dimer represents an uncommon example of 
large tertiary structure variations amplified at quaternary structure level by domain swapping. Although the biological 
relevance of the dimer needs further assessments, molecular modelling suggests that the two TmArgBP subunits may 
simultaneously interact with two distinct ABC transporters. Moreover, the present protein structures provide some clues 
about the determinants of the extraordinary stability of the biomolecule. The availability of an accurate 3D model represents 
a powerful tool for the design of new TmArgBP suited for biotechnological applications. 
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Introduction 

Structural plasticity is a fundamental feature of proteins. Despite 
the paradigm that associates protein sequences to weU-defined 3- 
drmensional structures [1], it is commonly accepted that proteins 
are often endowed with repertoires of distinct structural states. 
Protein structural transitions play major roles in several biological 
processes, including protein-protein recognition, protein-Ugand 
binding, and signalling. 

The mechanism of small molecule transport across biological 
membranes represents one of the most striking examples that 
highlights the role of protein flexibility in cellular processes. This 
transport is generally carried out by intricate systems in which 
dynamical events are essential for the transfer of information from 
one protein component to the other. Prototypical examples in this 
context are represented by the ATP-binding cassette (ABC) 
systems that play a fundamental role in the import of essential 
nutrients and in the export of toxic molecules in bacteria [2]. 



Canonical ABC cassette systems share a common structural 
organization comprised of two transmembrane domains (TMDs) 
that form the translocation pore and two nucleotide-bindiiig 
domains (NBDs) that hydrolyze ATP. The action of these systems 
generally depends on the presence of extra-cytoplasmic ancillary 
proteins, denoted as substrate binding proteins (SBPs), which 
recognize substrates with high affinity and deliver them to the 
TMD domains [2] . Interestingly, the structural characterization of 
the proteins involved in this process has revealed that these 
intricate molecular machines use complex dynamic mechanisms to 
fulfil their functions [3,4]. Indeed, SBPs undergo large structural 
rearrangement upon substrate binding according to the so-called 
Venus fly-trap mechanism [5] . The interaction of the substrate- 
bound form of SBPs with the periplasmic peptide regions of the 
TMDs of the cognate ABC transporter, initiates the transport 
process. Although the atomic details of this cascade of events are 
not fully understood, it is commonly accepted that the entire 
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STM4351 S. enterica (2Y7I) TS 13 

AbpA S. pneumonia (2162) KLWALN 46 

DSM 1740 W. succinogenes (3K4U) ELRVGLE 12 



TmArgBP (PDB code 4PRS) 
TTC0807 (WF) 

ArtJ G . stearothermophilus (2Q2C) 
STM4351 S. enterica (2Y7I) 
AbpA S. pneumonia (2162) 
DSM 1740 W. succinogenes (3K4U) 



ADFPPFE FVDE NGNIVGFDVDLAKE lARRLGVELKIVDMTFDGLl PS 82 

GAFPPFNYFDE RNQLTGFEVDLGNAIAERLGLKPRWIAQSFDTLLIQ 90 

AAFAPFEYMQ KGKIVGFDVDLLDAVMKAAGLDYELKNIGWDPLFAS 95 

ATYAPYEFVDA DNKIVGFDIDVANAVCKEMQAECSFTNQSFDSLIPS 60 

PDFAPFEYQKWDGKNQIVGSDIELAKAIATELGVELELSPMSFDNVLAS 96 

PGYLPFEMKDK KGNVIGFDVDLAREMAKAMGVKLKLVPTSWDGLIPG 59 



TmArgBP (PDB code 4PRS) 
TTC0807 (WF) 

ArtJ G . stearothermophilus (2Q2C) 
STM4351 S. enterica (2Y7I) 
AbpA S. pneumonia (2162) 
DSM 1740 W. succinogenes (3K4U) 



LLTKKIDVIISGMTITEERKKWAFSDPYFDAGQVIWRKDSDFRPKTYE 132 
LNQGRFDFVIASHGITEEFSARAVDFTNPHYCTGGVIVSRKG — GPRT-AK 137 
LQSKEVDMGISGITITDEPSKQSYDFSDPYFEATQVILVKQG — SPVKNAL 143 
LRFKKFDAVIAGMDMTPKREQQVSFSQPYYEGLSAWVTRK — GAYHTFA 108 
VQSGBCADLAISGVSKTDERSKVFDFSTPYYTAKNKLIVKKSDLATYQSVN 14 6 
LVTEKPD III SGMTISQERNLRVNFVE PYIWGQSLLVKKGLEKGVKS YK 109 



TmArgBP (PDB code 4PRS) 
TTC0807 (WF) 

ArtJ G . stearothermophilus (2Q2C) 
STM4351 S. enterica (2Y7I) 
AbpA S. pneumonia (2162) 
DSM 1740 W. succinogenes (3K4U) 



DLVGK — TVAVQIGTTGDIEVSKYDG--IKWRFDKFTDAFLELKRGRAD 178 

DLQGK — WGVQVGTTYMEAAQKIPG-IKEVRTYQRDPDALQDLLAGRID 184 

DLKGK — TIGVQNATTGQEAAEKLFGKGPHIKKFETTWAIMELLNGGVD 191 

DLKGK — KVGLENGTTHQRYLCDKQQ-AI TPVAYDSYLNAFTDLKNNRLE 155 

DLAQK — KVGAQKGSIQETMAKDLLQ-NSSLVSLPKNGNLITDLKSGQVD 193 

DLDKPELTLVTKFGVSAE YAAKRLFK-NAKLKTYDTEAEAVQEVIiNGKAD 158 
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AbpA S. pneijmionia (2162) 
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AWLDSATARAFVAKNP DLVISSGVLSSEQYGIAVRKEDTDLLEF 223 

TWI TDRFVAKEAIKER KLENTLQVGELVFQERVAMAVAKGNKSLLDA 231 

AVITDNAVANEYVKNNP-NKKLQVIEDPKNFASEYYGMIFPK-NSELKAK 239 
GVFGDVAAIGKWLKNNPDYAIMDERASDPDYYGKGLGIAVRKDNDALLQE 205 
AVIFEEPVAKGFVENNPDLAIADLNFEKE — QDDSYAVAMKKDSKELKEA 241 
MFIFDLPFNVAFMAQKG QGYLVHLDTSLTYEPLGWAIKKGDPDFLNW 205 



TmArgBP (PDB code 4PRS) INSVLRELKKSP-YDVLIEKWFSE 246 

TTC0807 (WF) LNRALAELMQDGTYARISQKWFGEDVRCK 260 

ArtJ G. stearothermophilus (2Q2C) VDEALKNVINSGKYTEIYKKWFGKEPKLDRLKQ- 272 

STM4351 S. enterica (2Y7I) INAALDKVKASPEYAQMQEKWFTQ 22 9 

AbpA S. pneumonia (2162) VDKTIQKLKESGELDKLIEDAFKASIEK 269 

DSM 1740 W. succinogenes (3K4U) IiNHFLAQIKHDGSYDELYERWFVDTKWLEKIQEG 239 



Figure 1. Multiple sequence alignment of TmArgBP with known homologues. For each protein the PDB code is reported in parenthesis. 
The signal peptide of TmArgBP is drawn in red. For the other proteins only the binding domain is reported. 
doi:1 0.1 371 /journal.pone.0096560.g001 



process relies on large conformational transitions of both TMD 
and NBD moieties of the ABC systems. 

The ability of SBPs associated with the ABC transport 
machinery to bind a variety of different ligands and their intrinsic 
structural versatility have made them very attractive systems for 
the development of platforms based on fluorescent protein 
biosensors for many naturally-occurring ligands. Indeed, by ad 
hoc re-engineering of their binding pockets it is possible to generate 
proteins able to recognize specific analytes for which sensors are 
eagerly needed. 

As ideal biosensors are expected to have remarkable stability, 
we turned our attention to proteins isolated from thermophilic 
organisms. We identified the arginine binding protein form 
Thermatoga maritima (TmArgBP) as an ideal system for arginine 
detection [6,7,8,9,10]. Arginine sensing is extremely important 
since argininemia is a debilitating inherited condition, charac- 
terized by a gradual accumulation of arginine and ammonia in 
the blood, whose diagnosis is crucial for effective medical 
intervention [11]. The biochemical and biophysical character- 
ization of the protein has shown that TmArgBP presents a 
number of remarkable properties. These include an extraordi- 
nary stability to both temperature and chemical denaturants [9]. 
Additionally, in contrast to the vast majority of SBP that 
operate as monomers, TmArgBP forms multimeric assemblies at 
room temperature [7,9]. 

In order to elucidate the atomic level structural properties of this 
intriguing protein, we determined the crystaUographic structures 



of both the apo and the arginine-bound forms of the protein. The 
analysis of these structures reveals some unexpected features and 
provides a solid structural framework for interpreting the 
biochemical and biophysical properties of the protein. The 
availability of accurate three-dimensional models for the different 
states of the protein strongly facilitates the design of variants that 
can act as durable and highly specific sensors for arginine. 

Results 

Detection and stability of TmArgBP dimer in solution 

TmArgBP is comprised of 246 amino acid residues, including a 
periplasmic signal localization peptide at its N-terminus. The 
analysis of the protein sequence performed using the TMHMM 
server (http://www.cbs.dtu.dk/services/TMHMM/) clearly indi- 
cates that residues 4 to 2 1 form a trans-membrane helix (Figure S5 
in File SI). The first three residues are expected to be outside the 
membrane whereas the periplasmatic portion corresponds to the 
residues 22-246 (Figure 1). We expressed and purified the full- 
length protein deprived of the membrane localization signal 
(residues 20-246), in a soluble and homogeneous form that was 
amenable for both solution and crystallography studies. 

The oligomerization state of the protein was initially charac- 
terized by performing gel filtration and light scattering measure- 
ments (Figure 2). In line with previous reports [7,9], gel filtration 
analysis indicates that the protein adopts different aggregation 
states (data not shown). Although aggregates with higher 
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Figure 2. Analytical SEC-MALS of TmArgBP. The blacl< and grey curves represent the Rayleigh ratio (left scale) of HoloTmArgBP and 
ApoTmArgBP, respectively; both are plotted against the retention time. Molecular masses are reported with the same colour code. In both 
experiments, average molecular masses values correspond to a dimeric state of the protein. 
doi:1 0.1 371/journal.pone.0096560.g002 



molecular masses were detected, we focused our attention on the 
main component. The analysis of this component by light 
scattering provided a weight-average molar mass of 50.3±0.2 
and 49.6±0.1 kDa, for HoloTmAi-gBP and ApoTmArgBP, 
respectively. Taking into account the theoretical mass of the 
TmArgBP monomer (25267 Da), these data clearly indicate a 
dimeric organization of both the apo and the holo forms in 
solution (Figure 2). We also observed that this dimer does not 
undergo dissociation or further aggregation with time. 

Native gel electrophoresis experiments confirms the presence of 
a single predominant species, likely the dimer, which is stable in 
the presence of 4 M urea (Figure 3A). In order to gain further 
insights into the stability of the TmArgBP dimer, the protein 
was heated in the presence denaturants (Figure 3). In particular. 
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Figure 3. Stability of the TmArgBP dimer. (A) Native PAGE 
electrophoresis of HoloTmArgBP and ApoTmArgBP. Lanes 1 and 2 
contain HoloTmArgBP and ApoTmArgBP, respectively. The same 
experiments were carried out (Lanes 3 and 4) in the presence of 4IV1 
urea. (B) SDS PAGE upon treatment of Holo-TmArgBP and (C) Apo- 
TmArgBP with increasing urea concentrations. Lanes 1 and 2 contain 
urea concentrations 0 and 8 M, respectively. The same markers were 
used in the two experiments. 
doi:1 0.1 371/journal.pone.0096560.g003 



SDS-PAGE (15%) was performed after heating the protein at 
100°C for 5 minutes and indicates the coexistence of the dimeric/ 
monomeric forms for both ApoTmArgBP and HoloTmArgBP. 
Interestingly, the dimer is fully dissociated only with the addition 
of urea at high concentrations (Figure 3B, G). However, the 
dissociation of the dimer is reversible, since this form can be 
restored upon urea dilution (data not shown). These results 
demonstrate that the dimeric organization of TmArgBP is 
endowed with a remarkable stabUity, which is independent of 
the binding state of the protein. The purified dimeric form of 
TmArgBP was used in the subsequent structural characterization. 

The overall structure of ApoTmArgBP 

All attempts to solve the structure of the apo/holo forms of 
TmArgBP using molecular replacement were unsuccessful. This 
was likely due to the large unit cell of the holo form and to the high 
flexibility of the apo form. Therefore, a seleno-methionine 
derivative of TmArgBP was prepared to set up anomalous 
dispersion experiments. The structure of the apo form was solved 
by performing multi-wavelength anomalous diffraction experi- 
ments, collecting data at three distinct wavelengths (Table 1). 

The structure solution unveils the presence of two independent 
polypeptide chains in the asymmetric unit of the ApoTmArgBP 
crystals. An analysis of the intermolecular contacts established by 
ApoTmArgBP molecules within the crystals, carried out by using 
the server PISA, clearly indicates that the these two independent 
molecules constitute a stable dimer. Indeed, the total buried area 
in the complex is 1387 A^. This finding fully agrees with the 
characterization of the ApoTmArgBP oligomeric state in solution 
(see above), (residues 23-110 and 210-231) and lobe II (residues 
116-203). 

Similar to other arginine-binding proteins, ApoTmArgBP is 
folded into two distinct lobes, each consisting of a P-sheet core 
formed by five strands, surrounded by helices, ^-sheets of lobes I 
(residues 23-1 10 and residues 210-231) and II (residues 1 16-203) 
present topologies (32P1P3PSP4 and PsPvPgPePioj respectively. The 
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Table 1. Data collection statistics. 







SeMet derivative Holo 


SeMet derivative Apo 








Peak 


Peak 


Inflection point 


Remote 


Beamline 


X12 


XI 2 


X12 


XI 2 


Space group 


P6,22 


C2 


a 


C2 


Unit cell parameters 


a (A) 


79.79 


116.81 


117.1 


116.75 


b (A) 


79.49 


51.97 


51.92 


51.96 


c (A) 


434.1 1 


99.00 


99.15 


99.00 


P. y (°) 


120.0 


122.8 


122.9 


122.8 


Resolution range (A) 


50.00-2.49 


50.00-1.47 


50.00-1 .46 


50.00-1.50 


Wavelength (A) 


0.9799 


0.9796 


0.9894 


0.9537 


Average redundancy 


15.0 (3.2) 


2.9 (2.1) 


4.4 (2.8) 


3.1 (2.2) 


Unique reflections 


29302 


82697 


85761 


77301 


Completeness (%) 


97.46 (75.5) 


97.3 (81.6) 


98.6 (87.3) 


97.6 (84.2) 


Rmerge (%) 


7.7 (15.1) 


4.9 (36.5) 


5.2 (39.4) 


4.5 (41 .7) 


Average I/<t(I) 


29.0 (3.5) 


18.9 (1.8) 


21.0 (1.9) 


16.8 (1.7) 



Values in parentheses are for higher resolution shells (2.53-2.49 A, 1.52-1.47 A, 1.51-1.46 A and 1.44-1.50 A for Holo-pk, Apo-pk, Apo-ip and Apo-rm, respectively). 
doi:1 0.1 371 /journal.pone.0096560.t001 



two lobes are connected by two distinct segments (residues 111- 
115 and 204-209), a characteristics typical of class II SBPs [12]. In 
ApoTmArgBP, the two connecting segments form a short two- 
stranded |3-sheet, which is stabilized by three main chain hydrogen 
bonds. In line with unliganded SBPs, the two lobes of the proteins 
are far away from each other and do not establish significant non- 
covalent interactions. 

The analysis of the protein N- and C-termini unveils interesting 
and unexpected features. The very N-terminal residues of the 
construct used in the present study (residues 20-27) form a stable 
a-helix. This suggests that the transmembrane helical region 
(residues 4—21) extends in the periplasm maintaining its structure. 
The limited number of contacts of this helix with the rest of the 
protein suggests that this region is free to adopt alternative states 
upon the anchoring of the signal peptide to the membrane. The 
analysis of the C-terminus indicates that TmArgBP dimerization is 
caused by its 3D domain swapping (Figure 4A). Indeed, the two 
subunits in the dimer mutually exchange their C-terminal helix 
(residues 237-246). An important role in the swapping process is 
played by the hinge region 232-KKSPY-236 whose extended 
conformation hampers the association of the terminal helix with its 
own subunit and favors its anchoring the main body of the other 
polypeptide chain (Figure 4B). A comparison of the local sequence 
of the hinge region with those of other arginine binding proteins 
with known 3D structures (Figure 1) suggests that it is more 
conformationally restrained in TmArgBP. Indeed, the hinge 
region of TmArgBP presents a deletion that shortens the loop 
length, which is associated with the concomitant presence of a Pro 
residue. It is reasonable to assume that this specific motif is 
responsible for the domain swapping observed in TmArgBP. 

According to the notation introduced by Eisenberg et al. [13] 
different contact surfaces may be identified in a 3D-swapping 
protein, as evidenced in Figure 4. The C-interface (closed 
interface) is the contact area formed by the swapping fragment 
with the main body of the other chain. This is expected to be 
present in the non-swapping monomer of the same or similar 
monomeric proteins. On the other hand, the O-interface (open 
interface) exclusively occurs in a 3D domain-swapped dimer and is 



not present in the closed monomeric form. In this framework, the 
C-interface of the ApoTmArgBP dimer reproduces the contacts 
between the C-terminal helix and the rest of the protein. This 
interface is stabilized by a strong salt bridge formed by the side 
chains of Asp56 of chain A with Lys242 of chain B (and viceversd). A 
weaker electrostatic interaction is formed by the side chains of 
Lysl93 of one chain and Glu241 of the other chain. In addition, a 
number of hydrophobic interactions are detected at the C- 
interface. In particular, close contacts are established by the side 
chains of Tyr236, Trp243, Phe244, Leu239 and Ile240 from the 
C-terminal helix of one monomer with a concave cavity of the 
adjacent monomer, formed by Phe53, Leu57, and Phell2 
(Figure 4C). On the other hand, only sporadic interactions are 
detected at the O-interface (Figure 4B). In contrast to the majority 
of assemblies characterized by domain swapping, the main bodies 
of the two subunits do not form any specific interaction. The only 
new interactions formed upon the formation of the swapped dimer 
are those involving the hinge peptide. A strong hydrogen bond 
between the two subunits is formed by the N main chain atom of 
Tyr236 of one chain with the oxygen atom of Leu231 of the other 
chain (and viceversd). In addition, the OH atom Tyr236 of the hinge 
region forms a hydrogen bond with the backbone oxygen atom of 
Tyrl 1 1. Therefore, the swapping dimer is essentially stabilized by 
interactions formed at the C-interface. Finally, it is worth noting 
that the two independent molecules of ApoTmArgBP display 
significant differences at the level of tertiary structure (Figure 4D). 
This indicates that this form is endowed with a significant 
flexibility (see also below). 

The crystal structure of HoloTmArgBP: molecular basis of 
arginine recognition by HoloTmArgBP 

In order to determine the structural basis of arginine recognition 
by TmArgBP we also determined the crystal structure of the 
bound form of the protein. We initially characterized the 
recombinant protein without adding any external amino-acid. 
For this form, a readily interpretable electron density throughout 
the entire structure was obtained by using the Single-wavelength 
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Figure 4. Domain-swapped dimer of ApoTmArgBP. (A) Cartoon representation of ApoTmArgBP swapping dimer. (B) Omit (Fo-Fc) map of the 
hinge region, contoured at 2a. (C) Interactions mediated by the C-terminal helix. (D) Superposition of the chains A and B of ApoTmArgBP. 
doi:1 0.1 371 /journal.pone.0096560.g004 



Anomalous Dispersion method, (Table 2). Although crystallization 
was performed without adding exogenous arginine, the inspection 
of maps corresponding to the putative arginine binding pocket 
clearly unveiled the presence of an elongated electron density 
corresponding to an arginine (Figure 5). This is indicative of the 
tight affmity of TmArgBP for arginine at room temperature. 

The crystals of HoloTmArgBP also contain two molecules in 
the asymmetric unit. The two molecules are virtually identical. 

Table 2. Refinennent statistics. 



with a root mean square deviation (RMSD) value, calculated on 
backbone atoms, of 0.9 A. Similar to the apo form, these two 
molecules form a tight dimer, which buries an area of978 The 
inspection of the electron density maps clearly shows that dimer 
formation occurs through swapping of the C-terminal helix, with 
the hinge region located between Lys232 and Tyr236 (Figure 6). 
The C- and the O-interfaces of the holo form are, in terms of H- 
bonds and hydrophobic interactions, virtually identical to those 







HoloTmArgBP 


ApoTmArgBP 


Resolution range (A) 


1 5.00-2.49 


15.00-1.47 


Asymmetric unit R (%) 


Dimer 17.8 


Dimer 14.9 


Rfree (%) 


22.8 


20.0 


No. of residues 


452 


454 


No. of ligand molecole 


2 


0 


No. of water molecules 


470 


820 


Mean B value (A^) 


303 


21.1 


R.m.s. deviations 


Bond lengths (A) 


0.018 


0.017 


Bond angles (°) 


1.9 


1.8 



Values in parentheses are for higher resolution shells (2.53-2.49 A and 1.52-1.47 A for Holo and ApoTmArgBP, respectively). 
doi:1 0.1 371/journal.pone.0096560.t002 
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Figure 5. (2Fo-Fc) electron density map contoured around tKie 
arginine ligand (2.0 a). Arginine interacting residues are 
KiigliligKited. 

doi:1 0.1 371 /journal.pone.0096560.g005 

detected for ApoTmArgBP. As a result, the formation of the 
interface causes, according to PISA [14], a strong gain of free 
energy of solvation (AG= —21.1 kcal/mol). 

The inspection of the TmArgBP binding pocket shows that the 
protein anchors arginine through a variegate ensemble of 
interactions that include hydrogen bonds, salt bridges and 
hydrophobic interactions. These interactions cage the arginine 
ligand by tightly anchoring both its backbone and the side chain, 
thus making it fully solvent inaccessible. Backbone interactions 
involve a bifurcated salt bridge between the carboxyl end of 
arginine and the side chain of ArglOl and a salt bridge between 
the backbone nitrogen and the side chain of Asp 183 (Figure 5). 
The aliphatic portion of arginine side chain is sandwiched between 
the two aromatic rings of Phe38 and Phe76, whereas the 
guanidine group is involved in salt bridges with Asp37 and 
Glu42, as well as hydrogen bonds with the side chains of Gin 142 
and with Serl31 (Figure 5). The binding of arginine ligand to 
ApoTmArgBP was quantified using Isothermal Titration Calo- 
rimetry (ITC) (Figure 7). Binding isotherms for the interaction of 
ApoTmArgBP with arginine, measured at pH 8.0, were charac- 
terized by exothermic heats of binding which decreased in 
magnitude with successive injections until saturation was achieved 
(Figure 7A). Consistent with the several interactions observed in 
the crystal structure of TmArgBP complex, our data indicate a 
strong enzyme-inhibitor binding, with Kjj in the low nanomolar 
range (1.3±0.9 nM). On the other hand, we observed that 
ApoTmArgBP is unable to bind glutamine (Figure 7B). This result 
suggests that the salt-bridge interactions with Asp56 and GluGl 
established by the guanidine group of Arg are essential for amino 
acid binding. 

ApoTmArgBP vs HoloTmArgBP: tertiary and quaternary 
structure variations 

The comparison of the two independent molecules present in 
the structures of ApoTmArgBP and HoloTmArgBP indirectly 
indicates that the two forms are endowed with distinct flexibility at 
tertiary structure level. The two molecules of the unliganded forms 
display significant differences whereas the holo molecules are 
virtually unchanged. Using the program DynDom [15], we 
computed a difference in the closure angle between lobe I and 
lobe II of 20° for the apo form (Figure 4D), whereas no dilference 
was detected for the holo form. These results show that the apo 
form is endowed with a larger intrinsic flexibility. 




Figure 6. Swapping dimer of IHoioTmArgBP. (A) Cartoon 
representation of HoloTmArgBP domain-swapped dimer. (B) Omit (Fo- 
Fc) map of the hinge region contoured at 2a. 
doi:10.1371/journal.pone.0096560.g006 

Although the basic secondary structure elements are preserved 
in the apo and holo forms of the protein, huge variations are 
observed at tertiary structure level. Indeed, RMSD values, 
calculated on the Ca atoms of each monomer are as high as 8.4 
A. The comparison of the tertiary structure of the liganded and 
unliganded structure demonstrates that the binding of the arginine 
ligand brings the two lobes together. The remarkable domain 
closure in HoloTmArgBP requires rotations of lobe II towards 
lobe I of 82°-84° starting from the two monomers of ApoT- 
mArgBP (Figure 8A). Consistendy, when single lobes of the two 
forms are overlapped, RMSD values drop to 0.5 A for both lobes. 

The observed variations of the tertiary structures of ArgBP upon 
arginine binding have a strong effect on the protein quaternary 
structure (Figure SB). Interestingly, ApoTmArgBP, which is 
characterised by a less compact and more flexible tertiary 
structure, presents a more compact quaternary structure. Indeed, 
the C-terminal helix of Apo-TmArgBP forms interactions with 
residues at the closed interface between the two lobes of the 
adjacent monomer (Figure 4), an interface which is not accessible 
in the Holo form, since locked by the arginine ligand. Consistently, 
the formation of the apo dimer covers a larger interface area than 
observed for HoloTmArgBP (1383 versus 978 A^), albeit with a 
similar gain of the free energy of solvation. 

Discussion 

The determination of the three-dimensional structure of the apo 
and holo forms of TmArgBP reveals, along with predictable 
features, some unexpected findings. The secondary structural 
elements and protein organization in distinct subdomains (lobes) 
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Figure 7. Isothermal titration calorimetry experiments with (A) arginine and (B) glutamine. Top panels report raw data for tine titrations 
at 25"C, whereas bottom panels report integrated heats of binding obtained from the raw data after subtracting the heats of dilution. The solid line 
(in A) represents the best curve fit to the experimental data using the 'one set of sites' model from MicroCal Origin. 
doi:1 0.1 371 /journal.pone.0096560.g007 



connected by two joining fragments confirm that TmArgBP is a 
type II SBP. In addition, the length of these joining regions 
classifies TmArgBP among cluster F according to the definition 
proposed by Poolman and co-workers [12]. The tertiary structure 
closure associated with the arginine binding is well described by 
the Venus Fly-trap mechanism observed for other SBPs [5,12]. 
The comparison of the apo and the holo forms of TmArgBP shows 
that the structural re-organization upon substrate binding is 
associated with a domain movement as large as 80°. 

One novel element emerged from the present study is that the 
protein forms a dimer through the swapping of the C-terminal 
helix. This dimer is characterized by rather loose quaternary 
structure organization (Figures 4 and 6). Indeed, the two subunits 
are essentially held together by interactions formed by the 
swapping C-terminal helices. In contrast, the main bodies (residues 
20-233) of the two chains do not estabhsh significant interactions. 
This feature suggests that both the apo and the holo forms are 
endowed with a remarkable flexibility at the quaternary level. 
Molecular dynamics simulations carried out on the swapping 
dimers of RNase A have clearly indicated that interactions formed 
by the main bodies of the proteins play a crucial role in dictating 
the overall flexibility of these assemblies [16]. Indeed, the RNase A 
C-terminal swapping dimer, which presents a loose interface [17], 
is highly flexible [16] whereas the N-terminal swapping dimer 
presents a tight association of the two subunits and is rather rigid 
[18,19]. In this scenario, it is likely that the dynamic behavior of 
both ApoTmArgBP and HoloTmArgBP resembles the one 
observed for RNase A C-terminal swapping dimer. It is important 
to note that solution studies presented here clearly show that the 
peculiar stability of TmArgBP is not confined to the individual 
subunits of the protein but also extends to the ohgomeric 
association. Therefore, domain swapping confers TmArgBP a 
combination of plasticity and stability that is not achievable in 



canonical (non-swapped) dimeric associations. Moreover, the 
preservation of the swapping in both the apo and the holo forms 
makes TmArgBP structure an unusual example of the preservation 
of domain swapping despite giant variations of the tertiary 
structure. It is interesting to note that TakP, a unrelated SBP 
from a TRAP transporter, also dimerizes through the swapping of 
the C-terminal helix [20] . In this case, however, tertiary structure 
variations associated with ligand binding are very limited. The 
oligomerization through domain swapping exhibited by TmArgBP 
also provides a rationale for the observed ability of the protein to 
form higher aggregates [7,9]. These states may be achieved 
through a mutual swapping of the C-terminal helix that is not 
limited to two subunits but involves three (trimers) or four 
(tetramers) protein molecules, as found in RNase A [13,21,22]. 

The observation that TmArgBP dimerizes through the 
exchange of the C-terminal helix leads us to question the sequence 
determinants of swapping. Literature studies have shown that, 
among other factors, the presence of Pro residues and/or the 
occurrence of deletions in the hinge regions favor the swapping via 
a destabHization of the monomeric form [23,24]. The analysis of 
TmArgBP hinge sequence shows the occurrence of a deletion that 
is concomitant with the presence of a Pro residue (Pro236). This 
feature is not found in other ArgBPs with a known 3D structure 
(Figwe 1). An analysis of protein sequences indicates that 
TmArgBP shares a similar hinge sequence only with very close 
homologs. Indeed, the most distant protein (NCBI Reference 
Sequence YP_00 1305722) that presents the same deletion and the 
Pro residue in the hinge peptide show a sequence identity with 
TmArgBP of 60%. In evolutionary terms, this may suggest that the 
propensity of this protein to dimerize through domain swapping is 
a recently acquired property. 

It is well know that the vast majority of SBPs operate as 
monomers as only sporadic examples of dimeric SBPs [12], 
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Figure 8. Variation of tertiary (A) and quaternary (B) structures 
of ApoTmArgBP (orange) and HoloTmArgBP (magenta). In both 
panels, overlapped regions are 22-104 and 206-243 of lobe I. The 
arrows highlight the conformational changes occurring upon arginine 
binding. 

doi:1 0.1 371 /journal.pone.0096560.g008 

essentially related to TRAP receptors, have been reported 
[20,25,26,27], In this scenario, the observation that TmArgBP is 
able to form a stable dimer leads to question about the relevance of 
the oligomeric state of TmArgBP on its biological function. We 
checked whether the two subunits of the TmArgBP dimer could 
simultaneously bind the ABC cassette system. The interactions of 
TmArgBP with an ABC transporter was modeled by following the 
procedure adopted by Vahedi-Faridi et al. [28] for ArtJ and using 
the complex between molybdate/tungstate ABC transporter and 
its cognate SBP [29] as a model template. The overall shape of this 
speculative complex (Figure 9) is compatible with the simultaneous 
binding of TmArgBP swapping dimer with two independent ABC 
transporter modules. The analysis of this complex also provides 
some preliminary indications on regions of TmArgBP involved in 
the transporter recognition. Based on analogy with ArtJ, there are 
two main regions comprised of residues 42—51 in lobe I and 
residues 171-177 in lobe II that play a major role in this process. 
Of particular interest is the observation that Glul63 of Artf, which 
was found by mutagenesis analyses to be a key player in this 
recognition process [28], is conserved in TmArgBP (Glul71). 

The availability of the 3D model of TmArgBP also offers the 
possibility to relate the extraordinary thermostability of the protein 




Cytoplasm 



Figure 9. A model for ABC cassette bound to HoloTmArgBP 
swapping dimer. 

doi:10.1371/journal.pone.0096560.g009 

to specific features of its structure. We compared sequence and 
structural features of TmArgBP with arginine binding proteins 
isolated from mesophHic organisms with known 3D structures. In 
particular, we selected the arginine binding proteins AbpA isolated 
from Streptococcus pneumoniae (PDB code 4162) and STM4351 
isolated from Salmonella enterka (PDB code 2Y7I) [30]. It is 
commonly accepted that electrostatic interactions frequently play 
a major role in protein structure stabilization [31]. In this 
framework, we initially considered the overall content of polar 
versus charged amino acids, since it has been shown that in 
thermostable proteins there is an accumulation of charged residues 
associated with a reduction of polar residues [32,33]. Our analysis 
indicate that TmArgBP contains a higher percentage (30.8% vs 
25.2%) of charged residues (Lys, Arg, Glu, and Asp) and a lower 
amount (14.5% vs 21.0%) of polar residues (Asn, Ser, Gin, and 
Thr) when compared to STM435 1 . This trend is less clear in the 
comparison with AbpA, as TmArgBP maintains a lower amount 
of polar residues (14.5% vs 21.8%), but with similar percentages of 
charged residues (30.8% vs 29.5%). This analysis agrees with 
studies carried out on other hypertermophHic enzymes, that also 
exhibit a remarkable thermostability [32,33]. We next evaluated 
the occurrence and the frequency of specific interactions between 
charged residues that are commonly shown to play a role in the 
thermal stabilization of protein structure. In particular, our 
analysis found the occurrence of a larger number of salt bridges 
in TmArgBP (11 bridges) compared to STM4351 (5 bridges) and 
AbpA (8 bridges). These observations indicate that electrostatic 
interactions likely play a significant role in the stabilization of the 
protein. 

This study also elucidates the molecular recognition mechanism 
of TmArgBP for arginine at atomic level. The protein binds the 
substrate through an intricate network of interactions, which 
results in a high affinity of binding at room temperature (Kq in the 
nanomolar range). Although this may appear in contrast with 
previous reports [7], the analysis of the 3D structure reconciles all 
data. Indeed, SPR experiments indicated a much lower affinity 
(Kd of 17 |iM) of the protein for a rather complex arginine- 
containing peptide [9]. The lower affinity for this peptide can be 
easily explained by considering that the protein strongly grasps 
both charged amino and carboxyl ends of the aminoacid. Very 
recently, it has been reported that mutations of residues of the 
TmArgBP binding pocket, that were specifically designed to 
achieve a fluorescence variation upon substrate binding, lead to 
variants that bind arginine with micromolar affinities [7]. The 
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analysis of TniArgBP binding pocket suggests that the replace- 
ments of Gly94, Met95, Glnl 16, and Thrl46 with a bulkier Trp 
strongly reduces the afiinity for the substrate through a partial 
occupation of the binding site or by inducing a local destabilization 
of the structure (Figure S6 in File SI). 

Taken together our data represent a significant advancement 
for the design and the characterization of TmArgBP mutants 
that may be used for constructing arginine sensors. It is worth 
mentioning that the use of the wild-type protein is not suited for 
this purpose since no variation of Trp florescence is observed 
upon arginine binding. Consistently, the local environment of 
Trp243 does not change in the structures of the apo and the 
holo forms here described, despite the huge structural 
rearrangements of the protein structure upon arginine binding. 
In both cases, this residue is buried by the interactions 
established by the swapped C-terminal region with the main 
body of the protein. The availability of an accurate 3D model is 
a powerful tool for the design of new TmArgBP variants, by 
fine-tuning substrate afiinity and fluorescence signal, that are 
better suited for biotechnological applications. 

Materials and Methods 

Protein sample preparation 

The wild-type protein used in these studies includes the residues 
20-246 of the protein sequence (UniProt code Q9WZ62). This 
region corresponds to the entire protein deprived of the signal 
sequence for its perisplasmic exporting. TmArgBP was expressed 
by using E. colt Rosetta(DE3)2 ceUs foflowing the procedure 
described in Luchansky et al. [9] and Ruggiero et al. [34]. Since 
the expressed protein is obtained in the arginine bound state 
(HoloTmArgBP), its ligand-free form (ApoTmArgBP) was pre- 
pared using protocols previously reported [9,34]. 

Light scattering experiments 

Purified proteins (HoloTmArgBP and ApoTmArgBP) were 
analyzed by size-exclusion chromatography connected to a triple- 
angle light scattering detector equipped with a QELS module 
(quasi-elastic light scattering). Specifically, protein samples of 
500 |xg were loaded on a S200 10/30 column, equilibrated in 
20 mM Tris-HCl (pH 8.0) and 150 mM NaCl. A constant fiow 
rate of 0.5 ml/min was applied. Elution profiles were detected by 
a Shodex interferometric refractometer and analyzed using a 
miniDawn TREOS light scattering system (Wyatt Instrument 
Technology Corp.). Data were processed using the Astra 5.3.4.14 
software package. 

Denaturing and native gel electrophoresis analysis 

The stability of the TmArgBP dimer was assessed by SDS 
polyacrylamide gel electrophoresis (SDS-PAGE). The samples 
were prepared in 10% (v/v) SDS and denatured at 373K for 

5 minutes. 

The oligomerization state of protein samples was also analyzed 
using native polyacrylamide gel electrophoresis (N-PAGE). In this 
case, purified proteins were diluted in native-sample buffer 
(0.06 M TrisHCl, 2.5% (v/v) glycerol, 0.001% (w/v) Bromophe- 
nol Blue, pH 6.8) and loaded on 10% (v/v) N-PAGE. The 
electrophoresis was carried out for 2 hours at 25 mA. 

Isothermal titration calorimetry 

The interaction of ApoTmArgBP with arginine or glutamine 
was investigated at 298 K by isothermal titration calorimetry using 
a MicroCal ITC200 calorimeter (GEHelthcare, Milan) calibrated 
with standard electrical pulses. All solutions were degassed by 



stirring under vacuum before use. In these experiments, 18 
consecutive injections of 2 (xL aliquots of a 1.25 mM arginine 
solution were added to the calorimeter ceU (0.280 mL) containing 
0.05 mM of apoTniArgBPgo- 246 at intervals of 150 seconds. To 
minimize the contribution of heat of dilution to the measured heat 
change, protein and ligand solutions were prepared in the same 
bufier (50 mM TrisHCl, 150 mM NaCl, pH 8). In order to ensure 
proper mixing after each injection, a constant stirring speed of 
1000 rpm was maintained during the experiment. Data were 
analyzed using a 'one set of sites' binding model. 

Protein expression, purification and crystallization 

The failure to solve the crystal structure of TmArgBP by 
molecular replacement [34], prompted us to prepare a selenome- 
thionine derivative (Se-Met) of the protein. The Se-Met derivative 
of TmArgBP was expressed by using E. coli Rosetta(DE3)2 cells in 
IL of minimal media (M9) enriched with the following compo- 
nents: 0.4% (w/v) glucose, 1 mM MgSOi, 0.1 mM CaCla, 
50 ugL~' ampicillin, 33 ug L~' chloramphenicol, 100 ugL~' 
thiamine at 37°C. After reaching an ODgoo of 0.7, an aminoacid 
mix (50 mg L~' He, Leu and Val and 100 mg L~' of Phe, Thr, 
and Lys) was added to the bacterial culture. After equilibration, 
60 mg L ' of seleno-L-methionine were added and the induction 
was performed by adding 0.5 mM IPTG. The labeUed protein was 
purified as previously described [34]. The homogeneity of the 
protein was evaluated by SDS-PAGE analysis. The molecular 
mass of the purified protein was checked by mass spectrometry 
and no proteolysis of the protein was detected. The ligand-free 
form of TmArgBP was prepared using procedures previously 
reported [34]. 

Crystallization of both hgand-bound and ligand-free SeMet 
TmArgBP was performed at 293 K by hanging-drop vapour- 
diffusion methods. In previous studies, HoloTmArgBP was 
crystallized in the presence of PEG 10,000 as a precipitant, 
whereas ApoTmArgBP was crystallized by using PEG 3,350 [34]. 
However, a further screening/ optimization of crystallisation 
conditions was achieved for the Se-Met derivatives of both 
HoloTmArgBP and ApoTmArgBP, using the PEG/Ion screen 
(Hampton Research) and then the Additive screen (Additive 
Formulation, Hampton Research). New crystaUization conditions 
were found for both forms. The best crystals of HoloTmArgBP 
were obtained using a protein concentration of 25 mg mL^ ' and 
0.2 M Potassium acetate, 20% (w/v) Polyethylene glycol 3,350 
using LDAO as an additive [34]. These crystals diffracted to 2.49 
A at the XI 2 beamline, DESY, Hamburg (Table 1). Cr>'stals of the 
ligand-free TmArgBP were obtained with the same procedure 
adopted for the native protein. Best crystals grew in 16-20 mg 
mL"' protein solution and in 25% w/v PEG 3,350 and 0.1 M 
Sodium Acetate trihydrate (pH 4.6). These crystals diffracted to 
1.5 A and belonged to the C2 space group (Table 1). 

Data collection, processing and structure determination 

DiflFraction data for the Se-Met derivatives of holo and apo 
forms of TmArgBP were collected at the XI 2 synchrotron 
beamline, DORIS storage ring, DESY (Hamburg, Germany) at 
100 K. Cryoprotection of the crystals was achieved by a fast 
soaking in a solution containing ethylene glycol to a final 
concentration of 14% (v/v). Because of a large unit cell, the 
diffraction data of HoloTmArgBP were collected using a small 
rotation angle of 0.1°. To avoid the overlap of difiracted intensities 
for HoloTmArgBP the resolution of the coUected dataset was 
limited at 2.49 A, although spots were detectable at resolution as 
high as 2.0 A. The data sets of both forms were scaled and merged 
using the HKL2000 program package [35]. 
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Multi-wavelength anomalous dif&action (MAD) experiments 
were carried on Se-Met labelled crystals. For peak and inflection 
wavelength determination, fluorescence scans were recorded for 
both holo and apo TmArgBP crystals. For ApoTmArgBP, data 
sets were collected at three wavelengths (peak, inflection and 
remote), optimised for Se-Met. For HoloTmArgBP, diffraction 
data were collected only at the wavelength corresponding to the 
peak. Statistics of data collection are reported in Table 1. 

Structure determination and refinement 

The Auto-Rickshaw pipeline was adopted to determine both 
the structures of holo and apo forms [36], using the SAD 
(Single-wavelength Anomalous Dispersion) and MAD methods, 
respectively. The initial set of phases was improved by using 
solvent-flattening and phase extension methods. Manual model- 
ling was performed using Coot [37]. 

Crystallographic refinement of both structures was carried out 
against 95% of the measured data using the ccp4i program suite. 
The remaining 5% of the observed data, which was randomly 
selected, was used in Rfree calculations to monitor the progress of 
refinement. Non crystallographic restraints were applied in 
REFMAC [38] with medium restraints for main-chain atoms 
and loose restraints for side-chain atoms. Water molecules were 
incorporated into the structure in several rounds of successive 
refinement. For the apo form the entire construct sequence 
(residues 20-246) was modelled in the final electron density. Due 
to the lower resolution of data, the two C-terminal residues could 
be not modelled for the holo form. The basic stereochemistry of 
the model was checked by using the program PROCHECK. We 
evaluated the occurrence of some corr(;lations between geomet- 
rical parameters that are typically detected in highly accurate 
protein structures and are not biased by restraints in the 
crystallographic refinement. In particular, we checked th(- 
dependence on the v|/ angle (i) of the NC^C bond angle [39,40], 
(ii) of peptide bond planarity [41] and (iii) of the carbon carbonyl 
pyramidalization &c [39,40,42] (Figures S1-S4 in File SI). For the 
high resolution apo structure, the NC"CVv(/ and the peptide 
planarity versus \|/ correlations are clearly detected. Surprisingly, 
the dependence of the peptide planarit)' on the v|f angle was also 
detected in the structure of the holo form despite the relatively low 
resolution of the dataset. This observation may be explained by 
considering that the crystals of the holo form diffracted at a 
resolution beyond the resolution limit of the dataset and that the 
large unit cell prevented the collection at higher resolution (see 
above). The structures of the HoloTmArgBP and ApoTmArgBP 
have been deposited in the PDB with the codes 4PSH and 4PRS, 
respectively. 

Notation 

Throughout the text, the residue numbering refers to the full- 
length protein 1-246. 

Supporting Information 

FUe SI Contains the following files: Figure SI. Distribution of 
NCaC angles in residues located in P-sheets (A) and a-helices (B) 
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performed by using the structure of HoloTmArgBP as template. 
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chain, directiy affects the binding site. On the other hand, the 
replacement of Gin 116 and Thrl46 likely induces a local 
destabUization of the protein structure. 
(PDF) 
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